Hindsight Optimization for Probabilistic Planning with Factored Actions
نویسندگان
چکیده
Inspired by the success of the satisfiability approach for deterministic planning, we propose a novel framework for on-line stochastic planning, by embedding the idea of hindsight optimization into a reduction to integer linear programming. In contrast to the previous work using reductions or hindsight optimization, our formulation is general purpose by working with domain specifications over factored state and action spaces, and by doing so is also scalable in principle to exponentially large action spaces. Our approach is competitive with state-of-theart stochastic planners on challenging benchmark problems, and sometimes exceeds their performance especially in large action spaces.
منابع مشابه
Probabilistic Planning via Determinization in Hindsight
This paper investigates hindsight optimization as an approach for leveraging the significant advances in deterministic planning for action selection in probabilistic domains. Hindsight optimization is an online technique that evaluates the onestep-reachable states by sampling future outcomes to generate multiple non-stationary deterministic planning problems which can then be solved using searc...
متن کاملPOND-Hindsight: Applying Hindsight Optimization to POMDPs
We present the POND-Hindsight entry in the POMDP track of the 2011 IPPC. Similar to successful past entrants (such as FF-Replan and FF-Hindsight) in the MDP tracks of the IPPC, we sample action observations (similar to how FFReplan samples action outcomes) and guide the construction of policy trajectories with a conformant (as opposed to classical) planning heuristic. We employ a number of tech...
متن کاملAnticipatory On-Line Planning
We consider the problem of on-line continual planning, in which additional goals may arrive while plans for previous goals are still executing and plan quality depends on how quickly goals are achieved. This is a challenging problem even in domains with deterministic actions. One common and straightforward approach is reactive planning, in which plans are synthesized when a new goal arrives. In...
متن کاملPlanning Under Temporal Uncertainty Using Hindsight Optimization
A robot task planner must be able to tolerate uncertainty in the durations of commanded actions and uncertainty in the time of occurrence of exogenous events. Sophisticated temporal reasoning techniques have been proposed to deal with such issues, although few existing planners support them. In this paper, we demonstrate the capabilities of a much simpler technique, hindsight optimization, in w...
متن کاملImproving Determinization in Hindsight for On-line Probabilistic Planning
Recently, ‘determinization in hindsight’ has enjoyed surprising success in on-line probabilistic planning. This technique evaluates the actions available in the current state by using non-probabilistic planning in deterministic approximations of the original domain. Although the approach has proven itself effective in many challenging domains, it is computationally very expensive. In this paper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015